Software Metrics Reduction for Fault-Proneness Prediction of Software Modules

نویسندگان

  • Yunfeng Luo
  • Kerong Ben
  • Lei Mi
چکیده

It would be valuable to use metrics to identify the fault-proneness of software modules. However, few research works are on how to select appropriate metrics for fault-proneness prediction currently. We conduct a large-scale comparative experiment of nine different software metrics reduction methods over eleven public-domain data sets from the NASA metrics data repository. The Naive Bayes data miner, with a log-filtering preprocessor on the numeric data, is utilized to construct the prediction model. Comparisons are based on the analysis of variance. Our conclusion is that, reduction methods of software metrics are important to build adaptable and robust software faultproneness prediction models. Given our results on Naive Bayes and logfiltering, discrete wavelet transformation outperforms other reduction methods, and correlation-based feature selection with genetic search algorithm and information gain can also obtain better predicted performance.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evaluation of Classifiers in Software Fault-Proneness Prediction

Reliability of software counts on its fault-prone modules. This means that the less software consists of fault-prone units the more we may trust it. Therefore, if we are able to predict the number of fault-prone modules of software, it will be possible to judge the software reliability. In predicting software fault-prone modules, one of the contributing features is software metric by which one ...

متن کامل

An Approach to Early Fault Prediction in Software Systems Using K- Means Clustering

Quality of a software component can be measured in terms of fault proneness of data. Quality estimations are made using fault proneness data available from previously developed similar type of projects and the training data consisting of software measurements. To predict faulty modules in software data different techniques have been proposed which includes statistical method, machine learning m...

متن کامل

Evaluation of Fault Proneness of Modules in Open Source Software Systems Using k-NN Clustering

Fault-proneness of a software module is the probability that the module contains faults. A correlation exists between the fault-proneness of the software and the measurable attributes of the code (i.e. the static metrics) and of the testing (i.e. the dynamic metrics). Early detection of fault-prone software components enables verification experts to concentrate their time and resources on the p...

متن کامل

Software Fault-proneness Prediction using Module Severity Metrics

Most of the fault prediction studies have focused on the binary classification models that determine whether the input modules are fault-prone or not. More recently, several studies have shown that severity-based multi-classification models are more useful since they can predict the fault-proneness depending on the severity of the defects in the module. We present new severity-based prediction ...

متن کامل

A Fault Detection Strategy for Software Projects

Original scientific paper The existing software fault prediction models require metrics and fault data belonging to previous software versions or similar software projects. However, there are cases when previous fault data are not present, such as a software company’s transition to a new project domain. In this kind of situations, supervised learning methods using fault labels cannot be applied...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010